Cost Drivers of a Parametric Cost Estimation Model for Data Mining Projects (DMCOMO)

نویسندگان

  • Óscar Marbán
  • Antonio de Amescua Seco
  • Juan Jose Cuadrado-Gallego
  • Luis García
چکیده

Data Mining is a research line that began in 1980 in order to find the knowledge that is hidden in the data that organizations are storing in a daily basis. This knowledge supports the decision-making processes in organizations. As a consequence companies of every kind have been developing data mining projects since the term appeared. However, there is no way to estimate this kind of projects. Although there are many references to Data Mining algorithms in the bibliography, not many authors have dealt the problem from Software Engineering point of view. CRISP-DM is a model process, from Software Engineering point of view, that appeared in 2000. CRISP-DM is the first standard of Data Mining projects development. In the standard of software development model process, e.g. ISO 12207 and IEEE 1074, processes and tasks are proposed similar to those in CRISP-DM model. Nevertheless, in software development a lot of methods are described to estimate the costs of project development (SLIM, SEER-SEM, PRICE-S and COCOMO). These methods are not appropriate in the case of Data Mining projects because in Data Mining software development is not the first goal. Some methods have been proposed to estimate some phases of a Data Mining project but there is no method to estimate the global cost of a generic Data Mining project. As a consequence, in this paper we propose the cost driver of a parametric estimation method for Data Mining projects.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A cost model to estimate the effort of data mining projects (DMCoMo)

CRISP-DM is the standard to develop Data Mining projects. CRISP-DM proposes processes and tasks that you have to carry out to develop a Data Mining project. A task proposed by CRISP-DM is the cost estimation of the Data

متن کامل

A New Cost Model for Estimation of Open Pit Copper Mine Capital Expenditure

One of the most important issues in all stages of mining study is capital cost estimation. Determination of capital expenditure is a challenging issue for mine designers. In recent decade, quite a few number of studies have focused on proposing estimation models to predict mining capital cost. However, these efforts have not achieved to a predictor model with reliable range of error. Both of ov...

متن کامل

Presented a method for estimating the cost of software using PCA to reduce the size and with the help of data mining

  These days, data mining one of the most significant issues. One field data mining is a mixture of computer science and statistics which is considerably limited due to increase in digital data and growth of computational power of computer. One of the domains of data mining is the software cost estimation category. In this article, classifying techniques of learning algorithm of machine ...

متن کامل

A New Optimized Hybrid Model Based On COCOMO to Increase the Accuracy of Software Cost Estimation

The literature review shows software development projects often neither meet time deadlines, nor run within the allocated budgets. One common reason can be the inaccurate cost estimation process, although several approaches have been proposed in this field. Recent research studies suggest that in order to increase the accuracy of this process, estimation models have to be revised. The Construct...

متن کامل

A New Empirical Model to Increase the Accuracy of Software Cost Estimation (TECHNICAL NOTE)

We can say a software project is successful when it is delivered on time, within the budget and maintaining the required quality. However, nowadays software cost estimation is a critical issue for the advance software industry. As the modern software’s behaves dynamically so estimation of the effort and cost is significantly difficult. Since last 30 years, more than 20 models are already develo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002